Goto

Collaborating Authors

 find solution



The Marginal Value of Adaptive Gradient Methods in Machine Learning

Neural Information Processing Systems

Adaptive optimization methods, which perform local optimization with a metric constructed from the history of iterates, are becoming increasingly popular for training deep neural networks. Examples include AdaGrad, RMSProp, and Adam. We show that for simple overparameterized problems, adaptive methods often find drastically different solutions than gradient descent (GD) or stochastic gradient descent (SGD). We construct an illustrative binary classification problem where the data is linearly separable, GD and SGD achieve zero test error, and AdaGrad, Adam, and RMSProp attain test errors arbitrarily close to half. We additionally study the empirical generalization capability of adaptive methods on several stateof-the-art deep learning models. We observe that the solutions found by adaptive methods generalize worse (often significantly worse) than SGD, even when these solutions have better training performance. These results suggest that practitioners should reconsider the use of adaptive methods to train neural networks.


How Machine Learning Can Help Fight Climate Crisis and Global Warming โ€“ A blog article automatically written by ChatGPT

#artificialintelligence

As the effects of climate change become more apparent, there is an urgent need to find solutions that can help to reduce greenhouse gas emissions and slow the pace of global warming. One technology that is gaining attention in this effort is machine learning, which has the potential to help us understand and address some of the most pressing challenges facing our planet. One of the key ways that machine learning can help in the fight against climate change is by providing us with insights and predictions that can inform decision making and policy. For example, machine learning algorithms can be used to analyze data on global temperatures, atmospheric carbon dioxide levels, and other environmental indicators, providing us with a better understanding of the current state of the planet and the likely impacts of different actions. Machine learning can also be used to optimize processes and systems in ways that reduce greenhouse gas emissions.


Feeding the world by AI, machine learning and the cloud

#artificialintelligence

The answer to this challenge, according to Thomas Jung, head of IT Research and Development at Syngenta, is regenerative agriculture. Just as important as clean water and clean air, soil is the critical foundation of agriculture. The crux of regenerative agriculture is to grow more food with less environmental impact by enhancing the health of soil. "So not much has changed, but we need to feed more and more people," he continues "How do we address this challenge of feeding the world in a sustainable fashion without exploiting our soils more?" Regenerative agriculture efforts look to find solutions to help plants stay healthy, find solutions to make crops more resistant to climate change-induced droughts and heatwaves, and use less water in farming. Therefore, what's necessary is, "moving beyond the traditional agriculture and the way we've been doing this for probably 100 years or more. I mean, this is a leap," says Jung. "This is an agricultural revolution that is ongoing, and artificial intelligence will play the decisive role in it."


Learning through atypical ''phase transitions'' in overparameterized neural networks

arXiv.org Machine Learning

Current deep neural networks are highly overparameterized (up to billions of connection weights) and nonlinear. Yet they can fit data almost perfectly through variants of gradient descent algorithms and achieve unexpected levels of prediction accuracy without overfitting. These are formidable results that escape the bias-variance predictions of statistical learning and pose conceptual challenges for non-convex optimization. In this paper, we use methods from statistical physics of disordered systems to analytically study the computational fallout of overparameterization in nonconvex neural network models. As the number of connection weights increases, we follow the changes of the geometrical structure of different minima of the error loss function and relate them to learning and generalisation performance. We find that there exist a gap between the SAT/UNSAT interpolation transition where solutions begin to exist and the point where algorithms start to find solutions, i.e. where accessible solutions appear. This second phase transition coincides with the discontinuous appearance of atypical solutions that are locally extremely entropic, i.e., flat regions of the weight space that are particularly solution-dense and have good generalization properties. Although exponentially rare compared to typical solutions (which are narrower and extremely difficult to sample), entropic solutions are accessible to the algorithms used in learning. We can characterize the generalization error of different solutions and optimize the Bayesian prediction, for data generated from a structurally different network. Numerical tests on observables suggested by the theory confirm that the scenario extends to realistic deep networks.


How artificial intelligence can help reduce carbon emissions

#artificialintelligence

Artificial intelligence (AI) is scientific intelligence that is mostly used by machines. It involves the use of large data sets of instruction that a computer follows to perform a particular task. The more detailed these instructions are, the more accurate the result. In this article, we will look into how you can use Artificial Intelligence to cut down your carbon emission. To solve a problem with AI, there's a need to approach the problem by thinking about a step-by-step solution.


AI is learning how to create itself

#artificialintelligence

But it's not what the bots are learning that's exciting--it's how they're learning. POET generates the obstacle courses, assesses the bots' abilities, and assigns their next challenge, all without human involvement. Step by faltering step, the bots improve via trial and error. "At some point it might jump over a cliff like a kung fu master," says Wang. It may seem basic at the moment, but for Wang and a handful of other researchers, POET hints at a revolutionary new way to create supersmart machines: by getting AI to make itself. Wang's former colleague Jeff Clune is among the biggest boosters of this idea. Clune has been working on it for years, first at the University of Wyoming and then at Uber AI Labs, where he worked with Wang and others. Now dividing his time between the University of British Columbia and OpenAI, he has the backing of one of the world's top artificial-intelligence labs. Clune calls the attempt to build truly intelligent AI the most ambitious scientific quest in human history.


How AI can be used for good

#artificialintelligence

As an IBM master inventor, professor at UC Irvine, and author of "Own the A.I. Revolution: Unlock Your Artificial Intelligence Strategy to Disrupt Your Competition," Sahota is also a lead artificial intelligence adviser to the United Nations and is helping find ways for AI to provide solutions and prevent future pandemics. Even now, AI is being used to create systems that can impact how treatments for COVID-19 are used. One such AI tool was developed at UC Irvine last year to help predict the probability of patients needing ICU care. This involved collecting the data of patients to get common symptoms of the coronavirus as well as how to accelerate treatment and care options. Other examples include AI-powered walking sticks for the blind, tools to help those who can't speak, and health care apps that use a cell phone to detect diabetes, tuberculosis and skin diseases through the camera and microphone.


Conjunction vs Disjunction: Bad Apples and Other Analogies

#artificialintelligence

Dive into machine learning, and you'll come across algorithms that include conjunctions and disjunctions. For example, you might come across a set of conjunctive rules in a hypothesis space (the set of all functions a model can return) or create a learning algorithm that builds a conjunction using similar features. Conjunctions and Disjunctions are one way to combine propositions into more complex ones. Propositions [noterm] are statements that are either true or false. For example, "2 is greater than 3" or "10 10 21."


Supercharge Knowledge Management With Help From AI

#artificialintelligence

In his 1999 book Management Challenges for the 21 Century, Austrian-born American management consultant, professor, and author Peter Drucker wrote of the importance of "the coordination and exploitation of organizations' knowledge resources, in order to create benefit and competitive advantage." Today, businesses have embraced his point, demonstrating how maintaining and growing an organization's information to assist its employees and customers offer those benefits and advantage. As a practice, this collecting and sharing of information is referred to as knowledge management. Even prior to Drucker's observation, the Consortium for Service Innovation had already begun its work in 1992 on Knowledge-Centered Service (previously known as Knowledge-Centered Support) or KCS *. KCS is a method that focuses on organizational knowledge as a key asset that can benefit, among other things, customer service delivery.